SPEECHDAT-CAR. A Large Speech Database for Automotive Environments

نویسندگان

  • Asunción Moreno
  • Børge Lindberg
  • Christoph Draxler
  • Gaël Richard
  • Khalid Choukri
  • Stephan Euler
  • Jeffrey Allen
چکیده

The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300 speakers) in seven characteristic environments (low speed, high speed with audio equipment on, etc.). This paper gives an overview of the project with a focus on the production phases (recording platforms, speaker recruitment, annotation and distribution). Automatic speech recognition (ASR) appears to be a particularly well adapted technology for providing voice-based interfaces (based on hands-free mode) that will enable new in-car applications to develop while taking care of safety aspects. However, the car environment is known to be particularly noisy (street noise, car engine noise, vibration noises, bubble noise, etc...). To obtain an optimal performance for speech recognition, it is necessary to train the system on large corpora of speech data recorded in context (i.e. directly in the car). For this reason, language-specific initiatives for database collections have been developed since about 1990 [Langmann (1998)]. The European project SpeechDat-Car 1 aims at providing a set of uniform, coherent databases for nine European languages and for American English. in developing large-scale speech resources for a wide range of languages and for in-car applications participation of external partners to the original consortium is also possible. Siemens is an 'external' partner. It is also important to note that SpeechDat-Car commits itself to a strict validation protocol to ensure 1 SpeechDat-Car started in April 1998 in the 4th EC framework under project code LE4-8334 with a 30 months' project duration. optimal quality and exchangeability of the databases [Van den Heuvel (1999)]. This paper gives an overview of the project with a focus on production phases. It is organised as follows: the next section describes the database specifications (database content, recording platforms and validation procedures). Then, Section 3 provides additional information on speaker recruitment and an extensive description of the annotation procedure and tools is given in section 4. The paper then concludes with a short section about database availability and dissemination. Each database produced in the SpeechDat-Car project is intended to provide enough data to adapt speaker independent recognition systems to the automotive environment. Database contents were designed to cope with different applications. The design …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speechdat-car: towards a Collection of Speech Databases for Automotive Environments

The SpeechDat-Car project is a 4th framework EC project in the Language Engineering programme. It aims at collecting a set of nine speech databases to support training and testing of robust multilingual speech recognition for in-car applications. The consortium participants are car manufacturers, telephone communications providers, and universities. This paper describes the background of the pr...

متن کامل

Speechdat-car: Speech Databases for Voice Driven Teleservices and Control of In-car Applications

The SpeechDat-Car project included in the 4 framework of the European Community's Language Engineering Programme, started in April 1998 with a duration of 30 months. It is a common initiative of car manufacturers, telephone communications operators, companies active in voice operated services and Universities that aims at collecting a set of speech databases in nine different languages to suppo...

متن کامل

First experiences of the German speechdat-car database collection in mobile environments

In SpeechDat-Car, speech databases for speech driven devices and services for mobile environments are collected for nine European languages. The German SpeechDat-Car installation was the first fully equipped platform within the project. It has served as a testbed for the recording software for the entire project, and as an opportunity to perform technical and organizational feasibility tests fo...

متن کامل

SpeechDat-Car Fixed Platform

SpeechDat-Car aims to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. Two types of recordings compose the database. The first type consist of wideband audio signals recorded directly in the car while the second type is composed by GSM signals transmitted from the car and recorded simultaneously in a far-en...

متن کامل

The speechdat-car multilingual speech databases for in-car applications: some first validation results

The main objective of SpeechDat-Car is to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. SpeechDat-Car started in April 1998 in the 4th EC framework under project code LE4-8334. The duration of the project is 30 months. Equivalent and similar resources for nine languages will be created: Danish, English, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000